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USE OF GRAM-POSmVE BACTERIA 
TO EXPRESS RECOMBINANT PROTEINS 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to the construction and use of a novel giam- 
positive expression vector system. Protein fusions containing the amino and 
carboxy sorting sequences of a gram-positive surface polypeptide can be 
anchored to the surface of a heterologous host. Alternatively, recombinant 
proteins may be secreted into the growth medium. This system can be used for 
overproducing and purifying recombinant proteins or peptides for any puipose, 
including but not limited to commercial scale production of diagnostic and 
vaccine antigens and therapeutic proteins, such as hormones and growth factors. 



10 



15 



2. Description of the Related An 

The ability to overproduce many prokaryotic and eukaryotic proteins has 
been made possible through the use of recombinant DNA technology. The 
introduction of chimeric DNA molecules into Escherichia coli has been the 
method of choice to express a variety of gene products. The main impetus 
behind the use of £. co//-based protein production systems is the host's shon 
generation time and well-developed genetics. Yet despite the development of 
20 many efficient E. coff-based gene expression systems in recent years, the most 
important concern continues to be that associated with downstream processing of 
the product. Recombinant proteins produced in E. coli do not readily cross the 
outer cell membrane (OM); as a result, polypeptides must be purified from the 
cytoplasm or periplasmic space (PS). Purification of proteins from these cellular 
compartments can be somewhat difficult. Frequenfly encountered problems 
include low product yields, contamination with potentially toxic cellular material 
(i.e., endotoxin) and the formation of large amounts of partially folded 
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polypepticfe chains in non-active aggregates, caUed inclusion bodies. As a result 
of these inherent piuification difficulties, a great deal of attention has recently 
been focused on the natural abiUty of many gram-positive bacteria to export 
proteins beyond their cell waU boundaries. In spite of the fact that the genetics 
of gram-positive bacteria are not as well-elucidated as those of E. coli, these 
microorganisms have, nevertheless, become recognized as more favorable 
candidates as hosts for the production of recombinant proteins. 

Proteins exported across the gram-positive cytoplasmic membrane (CM) 
generally have two fates: they are either released (secreted) into the extracellular 
milieu or they remain anchored to the ceU wall/membiane. Many of the recently 
developed gram-positive based expression systems have reUed exclusively on the 
former route of protein export (i.e., secretion). The basic strategy for directing 
the secretion of proteins in gram-positive expression systems involves fusing 
target proteins with functional N-terminal signal sequences of tiie gram-positive 
secretion systems currently available. By far, tite most popular hosts belong to 
tiie genus Bacillus. This group of microorganisms has been used by industry for 
tiie production of a variety of economically important proteins (Priest, Bacterial. 
Rev., 41:711 (1977); Glenn, Am. Rev. Microbiol, 30:41 (1976)). Perhaps the 
most extensively studied gram-positive host-vector expression system is based on 
the B. amyloliquefaciens alpha-amylase gene (Ulmanen et al. , J. Bacteriol. , 
162:176 (1985); Palva et ai, Proc. Natl. Acad. Sci. U.S.A., 79:5582 (1982); 
Palva et al. , Gene, 22:229 (1983); Lundstrom et al. , Virus Res., 2:69 (1985)). 
However, expression systems based on otiier gram-positive hosts such as B. 
subtilis, B. coagulans, B. lichenifomis (U.S. Patents 4,824,782, 5,171,673, 
25 4,711,843; WO patent 8,605,812; Chang, S., Methods in Enzymology, ISZ-.SQl- 
516 (1987)), Lactococcus and Lactobacillus spp. (U.S. Patent 5,242,821), 
Staphylococcus spp. (Abrahmsen et al., EMBO J. 4:3901-3906 (1986)), 
Streptomyces spp. (U.S. Patem 4,745,056; JP Patent 2,002,379; EP 352,707) 
and Corynebacterium spp. (U.S. Patent 4,965,197; WO Patem 9,303,158) have 
30 also been developed for general use. 
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Gram-positive secretion systems provide several advantages over 
traditional E. coli based expression systems. One obvious benefit is that proteins 
exported beyond the cell waU usually retain tiieir native conformation. As a 
result, one can take foil advantage of established purification protocols which are 
5 based on the fonctional properties of the active protein. In addition, giam- 
positive expression systems usually generate higher protein yields which are 
generally free of potentially toxic contaminating cellular material. 

There are, however, several practical limitations to the purification of 
exttacellular proteins. The first concern is that of protein instability. 

10 Recombinant proteins secreted by many gram-positive hosts are extremely 

sensitive to proteolytic degradation by host-encoded extracellular proteases. The 
presence of such proteases can drastically affect protein yield. Fortunately, 
significant improvements have been achieved in maintaining the viability of 
secreted proteins by genetically modifying host strains with reduced extracellular 

15 protease activities (Kawamura et al , J. Bacteriol. , 160:442 (1984); U.S. Patent 
5,084,383). 

Another important point to consider when using gram-positive secretion 
systems is the requirement for extensive downsti^m processing. It is widely 
accepted that the purification of recombinant proteins from bulky, large volume 

20 fermentations is an exti-emely time consuming and costiy proposition, bofli in 
terms of equipment and manpower. In those cases where tiie use of a gram- 
positive secretion system is simply not practical, it is common to resort to a less 
expensive, gram-negative based expression system. The decision to use E. coli 
as an alternative expression host, despite the potential pitfalls, is made simply on 

25 tfie basis that recombinant polypeptides remain associated with the bacterial cell 
(i.e., intracellular or periplasmic) and hence are generally easier to purify. 
Therefore, it is clear that what is needed in tills art is a bacterial expression 
system that incorporates the salient features of botii gram-positive (i.e., protein 
secretion) and gram-negative (i.e., protein compartmentalization) systems. A 

30 novel, but simple alternative to the currently available bacterial expression 
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systems would be the development of a gram-positive system that is able to 
specifically anchor or attach recombinant proteins directly to the cell wall 
surface. In this way the anchoring process can become an integral part of the 
purification process. For example, cells harboring a recombinant protein 
attached to the cell surface would be washed, collected, resuspended in a small 
volume and then treated with a specific agent to affect the release of die desired 
protein. Following removal of the bacterial cells, the resulting supernatant fluid 
would be highly enriched for the protein product. 

The cell wall of gram-positive bacteria is a complex organelle, which is 
assembled from peptidoglycans, carbohydrates, and proteins with different 
biological properties. Surface proteins differ from naturally secreted products in 
that the former require specific sorting signals that presumably aUow them to 
deviate from the normal default pathway of protein export. Despite the fact that 
many of the biologically important gram-positive proteins, including the 
streptococcal M protein, and the protein A and fibronectin binding proteins of S. 
aureus, are anchored at the cell surface, the cell wall of giam-positive bacteria 
remains a relatively unknown cellular compartment. 

While the normal N-terminal export signals that serve as the basis for 
bacterial secretion systems have been extensively characterized (Abrahmsen et 
al , EMBO J. , 4:3901-3906 (1985); Heijne and Abrahmsen, FEBS Letters, 
244:439-446 (1989); Pugsley, Microbio. Reviews, 57:50-108 (1993)), the m- 
acting sorting signals that orchestrate the anchoring of cellular proteins to the 
giam-positive cell wall have been only recentiy elucidated. Sequence alignment 
of well over 50 gram-positive surface proteins to date (excluding those from B. 
subtilis and S. pneumoniae) has revealed the existence of both amino- and 
carboxy-terminal secretion/anchoring signals. As expected, aU of the proteins 
examined contain an N-terminal signal sequence that presumably directs them to 
the ceUular export machinery. The C-termini of these proteins contain a 
predominantly hydrophobic, potential membrane spanning region that is 
immediately followed by a charged tail (Fischetti et al., Mol. Microbiol., 4:1603- 
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ms; Figure 1: Darkly shaded region = LP(X)TG(X) consensus sequence; 
lightly shaded region = carboxy-terminal hydrophobic, potential membrane' 
spanning region; residues in bold = charged protein tail; Protein A = 
Staphylococcus aureus protein A; M6 = Streptococcus pyogenes M6 protein; 
WapA = S. mutans wall-associated protein A; M49 = S. pyogenes M49 protlin; 
IgA-BP = streptococcal IgA-binding protein; Protein G = streptococcal protein 
G; Fn-BP = staphylococcal fibronectin-binding protein; T6 = streptococcal T6 
protein; Pac = S. mutans surface protein; Wg2 = S. cremoris surface protease). 
Preceding the hydrophobic domain is a proUne/glycine-rich region which has 
been predicted to span the peptidoglycan cell wall in a beta-sheet-like 
conformation. Within this region, aU of die surfece proteins examined have a 
nearly 100% conserved hexapeptide with the consensus LP(X)TG(X), where X is 
usuaUy a Thr or Ser and sometimes a Gly, Lys, or Asn (Figure 1). The 
conservation of the C-terrainal sequence elements (i.e., LP(X)TG(X) motif, 
hydrophobic membrane spanning region and charged tail) suggests that the 
process of sorting and anchoring polypeptides to the bacterial ceU wall is shared 
by many gram-positive species (Fischetti et al, Mol. Microbiol, 4:1603-1605 
(1990); Fischetti etal, Curr. Opinion Biotechnol. , 4:603-610 (1993)). 

The functional importance of the carboxy-terminal sorting elements 
identified by sequence alignment analysis has been confirmed using protein A 
(Moks et al. , Eur. J. Biochem. , 156:637-643 (1986)), a well-characterized 
surface protein of S. aureus, as a model system. Protein A, a single polypeptide 
chain, contains two major functional domains. The N-terminal domain contains 
a 36 amino acid signal peptide sequence followed by several repeated sequence 
modules (E, D, A, B, and C) that have immunoglobulin binding activity. The 
C-terminal domain contains conserved sorting signals which are thought to be 
responsible for anchoring protein A to the cell wall (Guss et al., Eur. J. 
Biochem. , 138:413-420 (1984); Guss et al. , Eur. J. Biochem. , 143:685 (1984); 
Fischetti et al,Mol. Microbiol., 4:1603-1608 (1990)). Using deletion analysis, 
Schneewind et al. (Cell, 70:267-281 (1992)) confirmed that the proper sorting of 
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protein A in S. aureus requires all three of the conserved sequence elements 
(i.e., -LPETGE-motif, C-terminal hydrophobic domain, and charged tail). Based 
on their results, Schneewind et al. {Cell, 70:267-281 (1992)) have proposed the 
following sequence of events in the anchoring of protein A and presumably many 
other gram-positive proteins to the cell surface. First, the N-terminal signal 
sequence of protein A directs the polypeptide into the general secretion pathway 
(GSP) (for reviews, see Pugsley, Microbio. Reviews, 57:50-108 (1993); Salmond 
and Reeves, TIBS 18:7-12 (1993)). During the process of membrane ' 
translocation, the main function of the C-terminal charged tail is to prevem 
secretion of the protein into the medium. Translocation of protein A across the 
membrane presumably results in the recognition of the C-terminal sorting signal. 
Following recognition, the LPETGE (LPXTGX) motif is cleaved specifically 
between the threonine (T) and glycine (G) residues and the resulting N-terminal 
protein fragment is then covalently linked to the S. aureus cell wall (Navarre and 
15 Schneewind, Mo/. M/crofoo/., 14:115-121 (1994)). 

Because the anchor motif of gram-positive surface proteins is conserved 
among a wide variety of molecules and several different gram-positive species, a 
logical question to ask is whether it would be possible to anchor a weU-known 
surface protein to the cell wall of a heterologous gram-positive host. Pozzi et al. 
20 {Res. Microbiol , 143:449-457 (1992)), using the fibrillar streptococcal M 
protein, were the first to address the question of protein anchoring in 
heterologous gram-positive hosts. Structurally, the M protein consists of an 
extended central alpha-helical coiled-coil rod flanked by functional end domains 
(Figure 2) (Phillips et al.,Proc. Natl. Acad. Set. (USA), 78:4689-4693 (1981); 
Fischetti, C//«. Microbiol. Reviews, 2:285-314 (1989))(Figure 2, (A) Structural 
representation of the M6 protein; (B) Linear representation of the M6 protein 
molecule. The map displays the N-terminal signal peptide (S), several repeated 
domains (A, B, C, and D), the proline/glycine rich region that contains the 
LPSTGE motif (LPXTGX), the C-terminal hydrophobic, potential membrane 
spanning domain (black bar), and charged tail (KRKEEN). Amino and carboxy 
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-Sfminai residueTimportant for export and cell wall sorting/anchoring of the M6 
protein are shaded. The anchor region is not drawn to scale.). The N-terminus 
contains a 42-residue signal sequence required for proper processing (Fischetti, 
CUn. Microbiol. Reviews, 2:285-314 (1989); Hollingshead et al.,J. Biol Chem 
261:1677-1686 (1986)), whUe the C-terminus, as determined by sequence 
comparisons, contains the conserved residues that are necessary for cell sorting 
and anchoring (Fischetti et a.. Mot. Microbiol, 4:1603-1605 (1990); PanchoU 
and Fischetti, J. BaaerioL, 170:2618-2624 (1988); Figure 1). After 'integration 
of a promoterless emm-6.1 gene in the chromosome of the human oral 
commensal S. gordonii, it was demonstrated that the M6 protein was correctiy 
localized to the cell wall of the heterologous host (Pozzi et al. , Res. Microbiol. , 
143:449-457 (1992)). 

The next logical step was to examine the possibiUty of exploiting the M6 
sorting signals in order to express recombinant proteins on the surface of a gram- 
positive host. The M6 protein has been successfully modified to deliver several 
heterologous antigens, such as sequences derived from the human papilloma virus 
type 16 E7 protein, the immunodominant epitope of HIV-1 gpl20 and the major 
allergen of white-face hornet venom, to the surface of S. gordonii (Pozzi et al. , 
Infect. Immun. , 60:1902-1907 (1992); Pozzi et al. , Vaccine, 12:1071-1077 
(1994); Medaglini et al. , Proc. Natl. Acad. Sci. (USA), in press (1995)). Gram- 
positive oral commensal bacteria expressing recombinant fusion proteins on their 
cell surfece have been used to eUcit both a mucosal and systemic immune 
response to foreign antigens while colonizing the oropharynx (Fischetti et al, 
Curr. Opinion Biotechnol, 4:603-610 (1993); Medaglini etal, Proc. Natl ' 
25 Acad. Sci. (USA), in press (1995)). 

The recent results of the M6 experiments are important for two main 
reasons. First, they serve to confirm the hypothesis that the mechanism of 
protein sorting and anchoring is an extremely well conserved, if not universal, 
process in gram-positive bacteria. More importantly, these experiments are 
evidence for proof of the concept that it is possible to exploit cell wall protein 
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-sortinr^^Ae purpose of constructing novel expression systems that 
anchor recombinant proteins to the surface of a gram-positive bacterial host 
Therefore, in view of the aforementioned deficiencies attendant with prior art 
expression systems, it should be apparent that there stiU exists a need in the art 
for the construction and use of a unique gram-positive protein expression system. 

SUMMARY OF THE INVENTION 

Accordingly, a major objective of the present invention is to develop a 
gram-positive bacterial expression system that can be used for fte production and 
purification of recombinant proteins. 

It is also the objective of this invention to provide, as a specific example 
the construction and use of the SPEX system (for Streptococcal protein 
expression), which is based on the streptococcal M6 protein. This system will 
allow one femiliar in the art of genetic manipulation and protein biochemistry to 
express and purify recombinant proteins from gram-positive hosts, such as the 
15 human oral commensal 5. gordonii. 

More specifically, the present invention provides a method for expressing 
proteins in gram-positive bacteria, comprising selecting a heterologous bacterial 
host which is gram-positive, deficiem in extraceUular protease production and 
capable of anchoring surface proteins via C-terminal sorting signals; introducing 
into the host a plasmid vector comprising DNA fragments encoding amino and 
carboxy terminal sorting sequences of a gram-positive cell wall surface protein 
DNA fragments coding for the desired protein product, and transcriptional and' 
translational control sequences that firnction in the chosen host; expressing a 
fusion protein comprising the desired protein product, an N-terminal signal 
sequence, and a C-terminal sorting sequence wherein the expressed protein is 
anchored to the host's cell surface; cleaving the anchored protein from the host's 
cell surface using a protease; and purifying the cleaved protein from the 
supernatant fluid. The present invention also provides an alternative method for 
expressmg proteins in a gram-positive host whereby the desired protein product 
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is secreted into the extracellular space, and for purifying the secreted proteins 
from the supernatant fluid. 

Briefly, the present invention features general methods for the 
construction and use of a gram-positive host/vector system that uses the cell 
sorting signals of surface proteins for targeted attachment to the cell waU. To 
demonstrate the utility of this system, the invention also describes the 
construction of several specialized vectors, designated SPEX, which are based on 
the S. pyogenes M6 protein. These vectors are intended to be used for the 
expression of proteins and peptides in gram-positive bacteria such as the human 
oral commensal, S. gordonii. A variety of recombinant protein products can be 
expressed by selecting die appropriate SPEX cloning vector. Several protein 
configurations are possible such as the presence of a combination of polyhistidine 
tag and protein A IgG binding domains, factor Xa or TEV protease cleavage 
sites, and a cell wall anchoring domain LP(X)TG(X). Each of the cloning 
vectors acts as a shutfle plasmid which permits the initial cloning to be performed 
in E. coli, followed by transformation into a gram-positive host, such as the 
naturally competent S. gordonii V288 (ChalUs). Depending on the SPEX vector 
used, recombinant proteins can be purified from an anchored state after cleavage 
with a specific protease or, alternatively, from the culture medium. 

With the foregoing and other objectives, advantages and features of tiie 
invention that will become hereinafter apparent, the nature of tiie invention may 
be more clearly understood by reference to the following detailed description of 
the preferred embodiments of the invention and to the appended claims. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts the alignment of the C-terminal residues from a select 
group of gram-positive surface proteins. 

Figure 2 depicts the ftinctional domains of the S. pyogenes M6 protein. 
Figure 3 depicts the nucleotide sequence of the S. pyogenes emmS gene 
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and adjacent DNA regions. 

Figure 4 is a diagrammatic representation of the construction of SPEX 
vectors. The pBLUESCRIPT (ApO backbone of each plasmid is not shown. 

Figure 5 illustrates possible SPEX cassettes useful for anchored surface 
5 production and purification of recombinant proteins. 

Figure 6 iUustrates possible SPEX cassettes used for the secretion and 
purification of recombinant proteins. 

Figure 7 depicts some possible S. gordonii strains for integrated 
expression of M6 fusion proteins. 

0 Figure 8 depicts integration of the M6 fusions into the S. gordonii 

chromosome. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

OF THE INVENTION 

More particularly, the present invention relates to general methods for the 
construction and use of a novel gram-positive expression system. Proteins 
produced by the expression system described herein can be purified either 
directly fi-om the cell surface or ft'om the supernatant fluid following secretion. 
For example, an expression system based on the well-known M6 protein of 
S. pyogenes is also described. This expression system, designated SPEX (for 
Streptococcal protein expression), uses the human oral commensal Streptococcus 
gordonii as a model host for this group of organisms. 



A. General Method 

The first major factor to be considered when constructing the gram- 
positive expression system is to determine which bacterium to use as a host. To 
25 be effective in the present invention the bacterial host should be a gram-positive 
bacteria tiiat is capable of anchoring surface proteins via C-terminal sorting 
signals. Such gram-positive bacteria include, but are not limited to, the genera 
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Aerococcus, Bifidobaaerium, Corprococcus, Deinobaaer, Deinococcus, 
Emerococcus, Gemella, Lactobacillus, Lactococcus, Leuconostoc, 
Mannococcus, Melissococcus, Micrococcus, Pediococcus, Peptococcus, 
Peptostreptococcus, Planococcus, Ruminococcus, Saccharococcus, Salinococcus, 
5 Carcina, Staphylococcus, Stomatococcus, Streptococcus, Trichococcus, and 
Vagococcus. To gain the maximum benefit from the present invention, the 
bacterial host should be deficient in extraceUular protease production. 

In the present invention the more preferred species to be used as an 
expression host is Streptococcus gordonii. The most preferred strain is a 
10 protease deficient variant of V288 (Challis). 

The second major consideration is the selection of the gram-positive ceU 
wall protein to be used as the basis for the construction of the expression vectors. 
Nearly all surface molecules of gram-positive bacteria have amino and carboxy 
terminal sorting signals that specifically target them for cell waU attachment 
15 (Fischetti et al. , Mol Microbiol. , 4: 1603-1605 (1990)). The expression system 
used in the practice of the present invention can be constructed with any gram- 
positive surface protein that contains the appropriate sorting signals. Such 
sorting signals can be readily identified by those skilled in the art. Sources of 
surfece proteins include, but are not limited to, such gram-positive genera as 
20 Aerococcus, Bifidobacterium, Corprococcus, Deinobaaer, Deinococcus, 

Emerococcus, Gemella, Lactobacillus, Lactococcus, Leuconostoc, Mannococcus, 
Melissococcus, Micrococcus, Pediococcus, Peptococcus, Peptostreptococcus, 
Planococcus, Ruminococcus, Saccharococcus, Salinococcus, Carcina, 
Staphylococcus, Stomatococcus, Streptococcus, Trichococcus, and Vagococcus. 

The more preferred protein to be used in the present invention is the type- 
6 M protein of Streptococcus pyogenes. The strucmral gene for the type 6-M 
protein from S. pyogenes has been cloned (Scott and Fischetti, Science, 221:758- 
760 (1983)) and the complete nucleotide sequence has been determined 
(Hollingshead er al. , J. Biol. Chetn. 261:1677-1686 (1986)). This sequence is 
30 shown in Figure 3. The predicted M6 amino acid residues are shown below the 
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-DNA sequence. The putative ribosome binding site (RBS) is boxed and the 
signal peptide H2 to -1) is underlined. The mature polypeptide represents 
residues +1 to 441. The proline/glycine rich ceU wall domain is shown as 
unshaded rectangles. The LPXTGX (LPSTGE) motif is shown in bold letters. 
The adjacent Ughtly shaded rectangle represents the hydrophobic, membrane 
spanning domain and the daridy shaded rectangle represents the C-terminal 
charged tail. Relevant restriction sites are shown above the DNA sequence. A 
transcriptional terminator is shown as the inverted repeat structure denoted by the 
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The next consideration is to isolate and characterize the DNA sequence 
that codes for the cell wall protein in the chosen host. Methods used for the 
construction and screening of genomic libraries (i.e., plasmid, cosmid, phage or 
YAC) to isolate the appropriate protein coding sequence and techniques for 
determining its nucleotide sequence are known to those skilled in the art of 
genetic engineering. See, e.g., Sambrook, Fritsch & Maniatis, Molecular 
Cloning: A Laboratory Manual, Second Edition (1989); Maniatis, Fritsch, & 
Sambrook, Molecular Cloning: A Laboratory Manual (1982) and Griffin & 
Griffin, PCR Technology: Current Innovations (1994). 

The DNA Iragments diat encode the amino and carboxy terminal sorting 
signals of the surface protein can be isolated using recombinant DNA techniques 
familiar to those skilled in the art. These fragments can be assembled on an 
E. coli cloning vector such as pUC18/19 or any other appropriate plasmid 
element. The N-terminal signal should include a complete leader sequence as 
well as a number of amino acid residues from the N-terminus of the mature 
protein. The inclusion of these sequences will insure proper targeting to the 
cellular export machinery and correct processing during the actual translocation 
process. The C-terminal sorting (i.e., anchor) signals should include the 
LPXTGX motif (where X is any amino acid residue), a hydrophobic membrane 
spanning domain and a charged tail. A comparison of such sorting sequences is 
shown in Figure 1. The darkly shaded sequences on the left side of the 
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aUgnment represents the LP(X)TG(X) consensus sequence and the proteins were 
aligned from this sequence. The lightly shaded region on the right side of the 
alignment represents the carboxyl-terminal hydrophobic, potential membrane 
spanning region. Residues shown in bold represent the charged protein tail. 

To faciHtate the construction of heterologous fusion proteins, a small 
DNA fragment containing the recognition sequences for several restriction 
endonucleases can be placed between the DNA sequences that encode the amino 
and carboxy terminal sorting signals. This multiple cloning site (MCS) wiU 
facilitate the insertion of heterologous protein coding sequences and upon 
expression will generate in-fiame protein fiisions that are anchored to the gram- 
positive cell wall surface. Alternatively, expression vectors can also be 
constructed that lack the C-terminal anchor signals. In this case, the protein 
fusions will not be targeted for anchoring but, instead, will be secreted into the 
extracellular space. 

15 The vectors described in the present invention will allow one familiar in 

the art to produce recombinant proteins that are either anchored at the cell 
surface or secreted into the extracellular space. Regardless of the final 
destination of the target protein, all of the vectors contain the M6 ribosome 
binding site (RBS), translation initiation codon (ATG) and the 42 amino acid 

20 leader sequence (-42 to -1) (Figure 3, 4 and 5). In addition, all of the vectors 
contain some portion of the mature N-terminus of the M6 protein. Previous 
studies have demonstrated that as many as 122 amino acids of the mature M 
protein N-terminus are also required for successful translocation. However, 
more recently, it has been shown that as few as 5 N-terminal amino acid residues 

25 of the mature protein may be required for proper processing and translocation of 
a M6 fusion protein to the cell surface (Fischetti et al., Curr. Opinion 
Biotechnol. 4:603-610 (1993)). The preferred number of M6 amino terminal 
residues to be used in the practice of the present invention is 58 and include 
residues -42 to +16 (Figure 3). 

30 The preferred C-terminai sorting signals used in the present invention are 
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also aenved trom the M protein of type-6 5. pyogenes. The vectors designed for 
targeted protein anchoring contain the M6 C-terminal residues 302 to 441, 
termination codon (TAA) and downstream sequences, which include a putative 
transcriptional terminator (Figure 3). Vectors that are designed for protein 
secretion do not contain the M6 C-terminal sorting signals. 

The expression vector systems of the present invention are suitable for 
overproducing and purifying any desired protein. At the very least, the 
expression vector systems of the present invention are useM for overproducing 
and purifying recombinant proteins or peptides. These proteins or peptides may 
be used for any purpose; for example, they may be used to generate diagnostic 
and vaccine antigens and therapeutic proteins, such as hormones and growth 
factors, and the like. 

To facilitate the release of anchored fiision proteins from the gram- 
positive cell wall, cleavage sites for various proteolytic enzymes can be 
engineered into the expression vectors. These unique sequences can optionally 
be incorporated into the vectors so that they reside either immediately 
downstream of the N-terminal signal peptide or immediately upstream of the C- 
terminal sorting signal domain. It is also possible to engineer cleavage sites at 
both locations. Proteases that may be suitable include, but are not limited to, the 
proteinase of tobacco etch virus (TEV), thrombin, enterokinase, and factor Xa. 

A variably sized protein spacer or linker region that physically displaces 
the fusion protein away from the cell surface may also be incorporated into the 
expression vectors. It is possible that in the absence of these spacers the 
efficiency of protease cleavage may be reduced due to steric hinderance with the 
ceU wall surface. These linker sequences can be incorporated between the 
protease recognition sequence and the C-terminaJ sorting signal domain. 

Any unique protease cleavage site can be incorporated into the vectors of 
the present invention. Preferably, the protease cleavage site will be greater than 
4 amino acid residues. The most preferred protease for use with die present 
invention is the TEV NIa proteinase. The TEV NIa proteinase cleaves a specific 
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consensus cleavage site which spans the seven amino acid sequence E-X-V/I/L- 
Y-X-Q*S/G (X can be any amino acid residue; Doughtery et al, EMBO J., 
7:1281-1287 (1988)). The preferred cleavage site for the present invention is E- 
N-L-Y-F-Q*G (Parks et al,Anal. Biockem. 216:413-417 (1994)). TEV 
cleavage sites can be engineered anywhere along the fusion construct; however, 
the preferred location is immediately adjacent to the proline/glycine rich region' 
harboring the LPXTGX motif (see Figure 5: RBS = M6 ribosome binding site; 
ATG = initiation codon; MCS = multiple cloning site (MCS); black bar = 
membrane spanning domain; closed triangles = engineered cleavage site for 
TEV (tobacco etch virus) NIa protease; open triangle = cleavage site for factor 
Xa (ffiGR/); small vertical arrow 16 = putative cellular signal peptidase 
cleavage site; see SPEXlb and SPEXlc). The addition of a protein spacer region 
between the TEV recognition sequence and the glycine/prolin^rich anchor region 
may improve TEV NIa proteinase efficiency. The preferred spacer for the 
present invention is derived from the M6 protein and contains amino acid 
residues 222 to 301 (Figure 5; SPEXld). 

Following protease treatment, the released proteins can be purified in a 
variety of ways known in the art. Suitable purification method include, but are 
not Hmited to, dialysis, ultrafiltration, zonal centrifugation, molecular-exclusion 
chromatography, isoelectric precipitation, solvent fractionation, electrophoretic 
separation, thermal precipitation, ion-exchange chromatography, affmity 
chromatography, and tiie like. A single step purification using affinity tags is 
preferred. These affinity tags, like the protease cleavage sites, can be optionally 
engineered at either the amino or carboxy terminal regions of the fusion protein. 
Useful affinity tags include, but are not Umited to, a polyhistidine tract 
(HHHHHH), the IgG binding domain of protein A and glutathione S-transferase 
(GST). Recombinant proteins released firom the surface of a gram-positive host 
can be easily purified in a one step process using metal chelation (e.g., Ni- 
agarose), protein A-Sepharose, and glutatiiione-Sepharose column 
chromatography. N-terminal signal and affinity tag sequences can be easily 
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renroved by incoiporating a protease cleavage site that is different from the 
recognition sequence used to remove the protein from the cell surface. 

The preferred affinity tag for the present invention is a consecutive stretch 
of 6 to 10 histidine residues (HHHHHH). A polyhistidine tag of six amino acid 
residues has been shown to be poorly immunogenic and rarely affects protein 
function and structm-e. The polyhistidine affinity tag can be engineered at either 
the amino or carboxy terminus of the protein. For the present invention, the 
preferred site is immediately downsfream of the N-terminal M6 sequence's 
(Figure 5). 

The N-terminal M6 and affinity tag sequences can be removed from the 
purified proteins by engineering in a second protease cleavage site. The 
preferred enzyme for the present invention is factor Xa. The preferred location 
is immediately downstream of the afifinity tag (Figure 5). 

In order to efficienUy express the recombinant gene which encodes the 
fusion protein, regulatory sequences should preferably be included that assure 
adequate transcription and translation. These vectors should contain 
transcriptional (i.e., promoters) and translational (i.e., ribosome binding site) 
control sequences that function in the chosen gram-positive host. 

A number of promoters may optionally be used in the practice of the 
presem invention. These regulatory sequences, which are known to those skiUed 
in the art, include heterologous promoters from both gram-negative and gram- 
positive bacteria. These regulatory sequences can cause the expression of a gene 
to be turned on or off in response to a chemical or physical stimulus. 
Alternatively, constitutively expressed promoter elements can also be used. 

Gene dosage is a variable that has been shown to possibly have an effect 
on protein production in several bacterial expression systems. In general, genes 
present on multiple copy plasmids generate higher levels of protein. The' 
plasmids used for the expression of recombinant proteins in the present invention 
should be capable of stably replicating in the chosen gram-positive host. These 
plasmids, which replicate at either a high or low copy number, are familiar to 
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those skilled in the art. 

In the present invention, the extrachromosomal elements used in plasmid- 
linked expression of recombinant genes may optionally contain appropriate 
genetic markers for selection purposes. Suitable genetic markers include various 
antibiotic and heavy metal resistance genes as weU as nutritional determinants. 
However, other genes that provide a selective advantage can also be used. 
Ideally the selective markers should be capable of being expressed in both E. coli 
and gram-positive hosts. This would allow one to carry out the initial cloning in 
E. coli with subsequent transfer of the recombinant plasmid to a gram-positive 
host. Furthermore, in order to faciUtate the insertion of heterologous DNA 
fragments, plasmids should also contain an in-frame polylinker or MCS which 
includes useful restriction endonuclease cleavage sites. 

Techniques for introducing recombinant plasmids into nontransformable 
hosts are familiar to tiiose skilled in the art. These methods include, but are not 
15 limited to, electroporation (Dower, in Genetic Engineering-Principles and 
Methods Vol.12, pp.275-296 (1990)), chemical transformation, conjugation, 
transduction, and the like. Recombinant DNAs can be easily introduced into 
those gram-positive species that are naturally competent by transformation. 

The presence of certain genes on multiple copy plasmids can occasionally 
cause toxicity and, in some cases, cell death. The toxic effects of these gene 
products can many rimes be reduced by simply decreasing the copy number of 
the determinant to be expressed. This can be readily achieved by stably 
integrating the gene encoding tfie fusion product directiy into the host 
chromosome. The genetic methods that are commonly used to create single copy 
prokaryotic expression modules involve homologous, site specific and illegitimate 
(transposition) recombination. 

The preferred method of gene expression for the present invention 
involves integration of the recombinant SPEX construct into the S. gordonii 
chromosome. S. gordonii strains that are useful as expression hosts include 
GP230 (Pozzi et al, Res. Microbiol. 143:449-457 (1992)), GP232 (Pozzi et al., 
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Infect, immm. . 60:1902-1907 (1992)) and GP251 (Pozzi et al. , Vaccine, 
12:1071-1077 (1994); Figure 7: enm-6 is the structural gene for the S. pyogenes 
type 6 M protein. The emC and cat genes encode erythromycin and 
chloramphenicol resistance, respectively. Thin lines flanking the inserted 
(boxed) sequences represents the S. gordomi chromosome. The integrated 
sequences are located downstream of a strong chromosomal S. gordonii promoter 
(P)). These strains contain genetic cassettes that have been integrated 
downstream of a strong S. gordonii promoter via homologous recombination. In 
the present invention, these cassettes provide flanking regions of homology that 
will allow one skilled in the art to replace the sequences presem on the GP230, 
GP232 and GP251 with the recombinant SPEX plasmids constructed in vivo. 

The following examples are presented in order to more fully illustrate the 
preferred embodiments of the invention. They should in no way be construed, 
however, as limiting the broad scope of the invention. 

EXAMPLE 1: Construction of SPEX plasmids 
A. Anchored vectors 

Plasmid pVMB20 (Pozzi eraL, Infect. Immun. 60:1902-1907 (1992)) was 
constructed by subcloning a 3.4 kb CM fragment, which contains a promoterless 
emm-6 gene and a gene encoding erythromycin resistance (emC; Horinouchi and 
Weisblum., J. Bacterial. 150:804-814 (1982)), from pVMB3 (Pozzi et al. Res 
Microbiol 143:449-457 (1992)) into the plasmid pBLUESCRIFT. A 545 bp 
interior region of the en,m-6 gene was removed by KpnJ and HinDllI digestion 
and replaced with a small DNA fragment harboring a synthetic multiple cloning 
site (MCS). The resultmg plasmid, designated pSMB55 (Figure 4), contains 
sequences encoding the first 122 N-terminal and the last 140 C-terminal amino 
acids of the M6 protein. DNA fragments encoding heterologous protein 
sequences are inserted into the MCS of pSPEXla or pSPEX2a to form in-frame 
fusions with the M6 N-terminal and/or C-terminal sequences. 
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1 he aacl-Hpal fragment from within the ermC determinant of pSMB55 was 
removed and replaced with a fragment harboring the aphlll gene encoding 
kanamycin resistance (Trieu-Cuot and Courvalin, Gene, 23:331-341 (1983)). 
The resulting plasmid was designated pSMB113 (Figure 4). 

Plasmid PSMB104 carries the gene encoding for M6^,„. a 

protein containing the first 16 N-terminal and the last 220 C-terminal amino acids 
(Pozzi, G., unpublished data). The Clal-Kpnl fragment of pSMB104 was used to 
replace the existing Oal-Kpnl fragment of pSMB113. The resulting plasmid 
vector, designated pSPEXla (Figure 4). can be used to construct protein fusions 
that contain the N-terminal 58 and C-terminal 140 amino acids of the M6 
protein. These protein fusions, when expressed in an appropriate gram-positive 
host, will be anchored to the bacterial cell surface (Figure 5). 

B. Secretion vectors 

Deletion of the HinJim-Sacl fragment of pSPEXla effectively removes 
the C-terminal sorting signals of the M6 protein. The resulting plasmid, 
designated pSPEX2a, retains the M6 N-terminal 58 amino add residues ' 
necessary for protein translocation and processing (Figure 4). As a result, 
protein fusions generated with pSPEX2a will be secreted into the growth 
medium. TEV proteinase cleavage sites and polyhistidine tags can be 
incorporated to faciUtate the removal of non-essential protein sequences and for 
purification purposes, respectively (Figure 6: RBS = M6 ribosome binding site; 
ATG = initiation codon; Term = termination codon; triangle = engineered ' 
cleavage site for TEV (tobacco etch virus) NIa proteinase; shaded residues = 
important for proper processing and secretion of M6; small vertical arrow = 
putative cellular signal peptidase cleavage site). 
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EXSMWJnZ. Expression of recombinant genes in S. gordonU 

In the present invention, recombinant genes encoding M6 fusion proteins 
can be expressed ftom multicopy plasmids. Since the preferred expression host 
in the present invention is S. gordonii, plasmids containing an origin of 
replication (oriV) that functions in a variety of streptococcal species can be used 
(Horodniceanu et al. , Antimicrob. Agents Chemother., 10:795-801 (1976); 
Clewell et al, J. Bacterial., 117:283-289 (1974); Behnke and Ferretti, Pllsmid, 
4:130-138 (1980); Macrina etal., Infect. Immun. 28:692-699 (1980); Macrina et 
al.. Gene 19:345-353 (1982)). Alternatively, recombinant genes can be 
expressed in the form of single copies which have been stably inserted into the 
S. gordonii chromosome. 

The recombinant plasmid is naturally linearized during transformation 
(Pozzi et al., Res. Microbiol., 141:659-670 (1990); Pozzi et al., Irtfect. Inrnm 
60:1902-1907 (1992)) and recognition of the flanking homologous segments 
facilitates integration of the M6 gene fiision, together with the aphlll gene 
(Figure 8: the cat gene presem in the S. gordonii recipient strain GP251 confers 
chloramphenicol resistance). Recombinant S. gordonii are selected for 
kanamycin resistance (conferred by aphlll) using the "multUayer" plating 
technique as previously described by Pozzi et al. {FEMS Microbiol. Lett. , 48: 
189-194 (1987)). Kanamycin-resistant transformants are then scored for loss of 
either erythromycin (conferred by erniQ or chloramphenicol (confeired by cat) 
resistance (depending on the recipient). Antibiotics are used at the following 
concentrations: erythromycin, 5 ^g/ml, chloramphenicol, 5 /xg/ml, and 
kanamycin, 500 ^g/ml. Southern blot (Southern, /. Mol. Biol, 98:503 (1975); 
Southern, Methods Enzymol, 69:152 (1980)) or PGR analysis of purified 
genomic DNA of Km'(Cm' or Em') transformants can be used to establish the 
structure and copy number of the expression cassette in the bacterial 
chromosome. 

Production of the recombinant protein can be determined by Western blot 
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— anaiysis-orwhoi^tl extracts ('I'owbin et al, Proc. Natl. Acad. Sci. (USA), 
76:4350 (1979); Towbin and Gordon, /. Immunol., 72:313 (1984)). Surface 
expression of the M6 fusion protein can be tested by immunofluorescence 
according to the procedure described previously (Pozzi et al.. Vaccine. 12:1071- 
1077 (1994)). Recombinant proteins can be detected using polyclonal antisera 
against the entire M6 molecule, monoclonal antibodies directed against specific 
M6 epitopes (Jones et al. . J. Exp. Med. , 164:1226-1238 (1986); Jones and 
Fischetti, /. Exp. Med.. 167:1114-1123 (1988)) or antisera directed against the 
heterologous protein expressed as a part of the fusion. 

EXAMPLE 3. Purification of recombinant proteins 
from 5. gordonii 

Recombinant S. gordonii are grown in an appropriate Uquid medium such 
as Todd Hewitt-Yeast Extract Broth (THYEB), brain heart infiision (BHI) or 
trypticase soy broth (TSB) containing kanamycin at 37°C for 24 hours or until 
stationary phase growth is achieved. The bacterial culture is centrifuged and 
depending on the location of the recombinant protein (cell-wall anchored or 
secreted) the ceU peUet or the spent growth medium is retained for further 
analysis. 

S. gordonii cells harboring surface expressed proteins are collected, 
washed and resuspended in a minimal volume of PBS (150 mM NaCl, 16 mM 
Na,HP0„ 4mM NaH,PO, pH 7.3), in the presence of proteinase inhibitors. 

Recombinant proteins with engineered protease cleavage sites are removed 
from the cell surface by the addition of TEV NIa proteinase. Following 
incubation for a suitable period of time at 25-30°C, the cells are removed by 
centrifugation and the supernatant fluid, which is enriched for the recombinant 
protein, is collected. Those fusion proteins containing tandem histidine residues 
can be further purified by passing the supernatant over a nickel-chelating resin 
(ProBond, Invitrogen Corp.) or nickel nitrilotriacetic acid [Ni-NTA] agarose 
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(Qiagen Inc.). Following washing, the recombinant protein is then eluted from 
the column using an imidizole gradient under mild conditions (i.e., pH 6.0). The 
amount of purified protein can be measured by the method of Bradford 
(Bradford, Anal Biochem., 72:248-254 (1976)). 

The use of the SPEX2 series of vectors (Figure 6) generates fusion 
proteins that are secreted into the surrounding medium during growth. 
Following removal of the bacterial ceUs, the spem growth medium is retained 
and used as the starting material for purification of the desired product. 
Recombinant proteins can be purified using metal chelation chromatography, as 
described above, and non-essential N-terminal residues can be removed witii 
TEV NIa proteinase treatment. 

While the invention has been described and illustrated herein by 
references to various specific materials, procedures, and examples, it is 
understood that the invention is not resteicted to the particular combinations of 
materials and procedures selected for tiiat purpose. Numerous variations of such 
details can be impUed and will be appreciated by tiiose skilled in the art. 



wo 96/40943 

PCT/US96/09965 



-23- 

WHAT IS CLAIMED IS: 

1. A method for expressing a desired protein in gram-positive bacteria, 
comprising: 

selecting a heterologous bacterial host which is 
5 gram-positive, and 

capable of anchoring surface proteins via C-terminal sorting signals; 
introducing into the host a plasmid vector comprising 
DNA fragments encoding 

amino and carboxy terminal sorting sequences of a gram-positive 
^0 cell wall surface protein and 

the desired protein, and 

transcriptional and translational control sequences that function in the 
chosen host; 
expressing a fusion protein comprising 
^5 the desired protein, 

an N-terminal signal sequence, and 
a C-terminal sorting sequence 

wherein the expressed fusion protein is anchored to the host's cell 
surface; 

cleaving die anchored protein from the host's cell suriace using a 

protease; and 
recovering and purifying the cleaved protein. 

2. The method of claim 1 wherein the host is selected from the genus 
Aerococcus, Bifidobacterium, Corprococcus, Deinobacter, Deinococcus, 
25 Enterococcus, Gemella, Lactobacillus, Lactococcus, Leuconostoc, 

Marinococcus, Melissococcus, Micrococcus, Pediococcus, Peptococcus, 
Peptostreptococcus, Planococcus, Ruminococcus, Saccharococcus, 
Salinococciis, Carcina, Staphylococcus, Stomatococcus, Streptococcus, 
Trichococciis, or Vagococcus. 
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3. me method of claim 1 wherein the host is from the genus Streptococcus. 

4. The method of claim 1 wherein the host is Streptococcus gordonii. 

5. nie method of claim 1 wherein the host is Streptococcus gordonii V288 
(Challis), GP230, GP232, or GP251. 



5 6. The method of claim 1 wherein the host is deficient in extracellular protease 

production. 



7. The method of claim 1 wherein the gram-positive ceU wall surface protein is 
derived from the gtms Aerococcus, Bifidobacterium, Corprococcus, 
Deinobacter, Deinococcus, Emerococcus, Gemella, Lactobacillus, 
Lactococcus, Leuconostoc, Marinococcus, Melissococcus, Micrococcus, 
Pediococcus, Peptococcus, Peptostreptococcus, Planococcus, Ruminococcus, 
Saccharococcus, Salinococcus, Carcina, Staphylococcus, Stomatococcus, 
Streptococcus, Trichococcus, or Vagococcus. 



10 



8. The method of claim 7 wherein the gram-positive cell wall surface protein is 
15 the type-6 M protein of Streptococcus pyogenes. 

9. The method of claim 7 wherein the gram-positive cell wall surface protein 
comprises: 

an N-terminal signal sequence comprising a complete leader sequence 
and a number of amino acid residues from the N-terminus of the mature 
protein; 

a C-terminal sorting signal comprising the LPXTGX motif, wherein X i 
any amino acid residue, residing within a proline/glycine rich region; 
a hydrophobic membrane spanning domain; and 
a charged tail. 
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ro^The method of claim 9, wherein the gram-positive ceU waU surface protein 
additionally comprises a DNA fragment having a recognition sequence for at 
least one restriction endonuclease (the multiple cloning site, MCS) located 
between the DNA sequences that encode the amino- and carboxy-terminal 
5 sorting signals. 

11. The method of claim 1 wherein the plasmid vector further comprises at least 
one cleavage site for a proteolytic enzyme. 

12. The method of claim 1 wherein the protease is the proteinase of tobacco etch 
virus (TEV), thrombin, enterokinase, or factor Xa. 
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13. The method of claim 9 wherein the plasmid vector additionally comprises a 
variably sized protein spacer or linker sequence that physically displaces the 
fusion protein away from the cell surface. 

14. The method of claim 13 wherein the linker sequence is incorporated between 
the protease recognition sequence and the C-terminal sorting signal domain. 

15 15. The method of claim 1 wherein the cleaved proteins are purified in a single 
step througli the use of affinity tags. 

16. The method of claim 15 wherein the affinity tags are polyhistidine tracts 
containing from 6 to 10 histidine residues, the IgG binding domain of protein 
A, or glutathione S-transferase. 

20 17. The method of claim 1 wherein the cleaved proteins are purified using metal 
chelation, protein A-Sepharose, glutathione-Sepharose column 
chromatography, or a combination thereof. 
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18. The method of claim 1 wherein the plasmid vector additionally comprises 
extrachromosomal genetic markers that are sufficient to provide a selective 
advantage to a host expressing said genetic markers. 

19. The method of claim 18 wherein the genetic markers are antibiotic genes, 
5 heavy metal resistance genes, and nutritional determinants. 

20. The method of claim 18 wherein the genetic markers are capable of being 
expressed in both E. coli and gram-positive hosts. 

21. The method of claim 18 wherein the plasmid vector additionally comprises a 
polyUnker or multiple cloning site which includes restriction endonuclease 

10 cleavage sites. 

22. The method of claim 7 wherein the plasmid vector comprises sequences 
coding for the M6 ribosome binding site, the translation initiation codon 
(ATG), the 42 amino acid leader sequence (-42 to -1), and a portion of the 
mature N-terminus which is at least 5 amino acids long. 

15 23. The metiiod of claim 22 wherein the portion of the N-terminus coded for 
comprises residues -42 to +16. 

24. The method of claim 1 wherein the C-terminal sorting signal is derived from 
the M protein of type-6 S. pyogenes. 
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25. The method of claim 1 wherein the plasmid vector additionally comprises the 
M6 C-terminal residues 302 to 441, termination codon (TAA), and 
downstream sequences which comprise a transcriptional terminator. 

26. The method of claim 1 , wherein the plasmid vector is pSPEXla. 
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27. The method ol claim 26, wherein the pSPEXla vector additionaUy comprises 
a protease which has a cleavage site of greater than 4 amino acid residues. 

28. The method of claim 26, wherein the protease is the TEV NIa proteinase, 

which cleaves a specific consensus cleavage site which spans the seven amino 

5 acid sequence E-X-V/I/L-Y-X-Q*S/G, wherein X can be any amino acid 

residue. 

29. The method of claim 28, wherein the cleavage site is E-N-L-Y-F-Q*G. 

30. The method of claim 28. wherein the cleavage site is located adjacent to the 
proline/glycine-rich region harboring the LPXTGX motif. 

10 31 . The method of claim 28, wherein a protein spacer is added between the TEV 
proteinase recognition sequence and the proline/glycine-rich anchor region. 

32. The method of claim 31, wherein the spacer is derived from the M6 protein 
and consists of amino acid residues 222 to 301 . 

33. A method for expressing a desired protein in gram-positive bacteria, 
15 comprising: 

selecting a heterologous, gram-positive bacterial host; 
introducing into the host a plasmid vector comprising 
DNA fragments encoding 

amino and carboxy terminal sorting sequences of a gram-positive 
cell wall surface protein, and 
the desired protein; and 

transcriptional and translational control sequences that function in the 
chosen host; 
expressing a fusion protein comprising 
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the-desirediJiDtemTana — 

an N-terminal signal sequence 

wherein the expressed protein is secreted from the ceU; and 
purifying the protein product from the supernatant fluid following 
5 secretion. 



34. The method of claim 33 wherein the host is selected from the genus 
Aerococcus, Bifidobacterium, Corprococcus, Deinobacter, Deinococcus, 
Emerococcus, Gemella, Lactobacillus, Lactococcus, Leuconostoc, 
Marinococcus, Melissococcus, Micrococcus, Pediococcus, Peptococcus, 

10 Peptostreptococcus, Plamcoccus, Rwninococcus, Saccharococcus, 

Salinococcus, Carcina, Staphylococcus, Stomatococcus, Streptococcus, 
Trichococcus, or Vagococcus. 

35. The method of claim 33 wherein the host is from the genus Streptococcus. 

36. The method of claim 33 wherein the host is Streptococcus gordonii. 

15 37. The method of claim 33 wherein the host is Streptococcus gordonii V288 
(Challis), GP230, GP232, or GP251. 



38. The method of claim 33 wherein the host is deficient in extracellular protease 
production. 



The method of claim 33 wherein the gram-positive cell wall surface protein 
is derived from the genus Aerococcus, Bifidobacterium, Corprococcus, 
Deinobacter, Deinococcus, Emerococcus, Gemella, Lactobacillus, 
Lactococcus, Leuconostoc, Marinococcus, Melissococcus, Micrococcus, 
Pediococcus, Peptococcus, Peptostreptococcus, Planococcus, Ruminococcus, 
Saccharococcus, Salinococcus, Carcina, Staphylococcus, Stomatococcus, 



wo 96/40943 



PCT/US96/09965 



-29- 

Streptococcus, Trichococcus, or Vagoco(xm. 

40. The method of claim 39 wherein the gram-positive cell wall surface protein 
is the type-6 M protein of Streptococcus pyogenes. 

41. The method of claim 39 wherein the gram-positive ceU wall surface protein 
comprises: 

an N-terminal signal sequence comprising a complete leader sequeice 
and a number of amino acid residues from the N-terminus of the mature 
protein; 

a C-terminal sorting signal comprising the LPXTGX motif, wherein X is 
any amino acid residue, residing within a proline/glycine rich region; 
a hydrophobic membrane spanning domain; and 
a charged tail. 

42. The method of claim 41, wherein the plasmid vector additionally comprises a 
DNA fragment having a recognition sequence for at least one restriction 

15 endonuclease (the multiple cloning site, MCS) located between the DNA 

sequences that encode the amino- and carboxy-terminal sorting signals. 

43. The method of claim 41 wherein the expression vector additionally comprises 
a variably sized protein spacer or linker sequence that physically displaces 
the fusion protein away from the cell surface. 

20 44. The method of claim 43 wherein the linker sequence is incorporated between 
the protease recognition sequence and the C-terminal .sorting signal domain. 

45. The method of claim 33 wherein the plasmid vector additionally comprises 
extrachromosomal genetic markers that are sufficient to provide a selective 
advantage to a host expressing said genetic markers. 



10 
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-^6TThe memod of cJaira 45 wherein the genetic markers are antibiotic genes, 
heavy metal resistance genes, and nutritional determinants. 

47. The method of claim 45 wherein the genetic markers are capable of being 
expressed in both E. coli and gram-positive hosts. 

48. The method of claim 45 wherein tiie plasmid vector additionally comprises 
polyiinker or multiple cloning site containing restriction endonuclease 
cleavage sites. 

49. The method of claim 39 wherein the plasmid vector comprises sequences 
coding for the M6 ribosome binding site, the translation initiation codon 
(ATG), the 42 amino acid leader sequence (-42 to -1), and a portion of the 
mature N-terminus which is at least 5 amino acids long. 

50. The method of claim 49 wherein the portion of the N-terminus coded for 
comprises residues -42 to +16. 



51. The method of claim 33 wherein the vector is one of the SPEX2 series of 
15 vectors. 



52. The method of claim 33 wherein recombinant proteins are purified using 
metal chelation chromatography. 

53. The method of claim 33 wherein non-essential N-terminal residues can be 
removed with TEV Nla proteinase treatment. 
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